Co-evolution of density and topology in a simple model of city formation 
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We study the influence that population density and the road network have on each others' growth 
and evolution. We use a simple model of formation and evolution of city roads which reproduces 
the most important empirical features of street networks in cities. Within this framework, we 
explicitely introduce the topology of the road network and analyze how it evolves and interact with 
the evolution of population density. We show that accessibility issues -pushing individuals to get 
closer to high centrality nodes- lead to high density regions and the appearance of densely populated 
centers. In particular, this model reproduces the empirical fact that the density profile decreases 
exponentially from a core district. In this simplified model, the size of the core district depends on 
the relative importance of transportation and rent costs. 

PACS numbers: 



I. INTRODUCTION 

It has been recently estimated that more than 50% 
of the world population lives in cities and this figure is 
bound to increase Jj. The migration towards urban ar- 
eas has dictated a fast and short-term planned urban 
growth which needs to be understood and modelled in 
terms of socio-geographical contingencies, and of the gen- 
eral forces that drive the development of cities. Previous 
studies 0, H, 01 about urban morphology have mostly 
focused on various geographical, historical, and social- 
economical mechanisms that have shaped distinct urban 
areas in different ways. A recent example of these stud- 
ies can be found in [Sj] , where the authors study the pro- 
cess of self-organization of transportation networks with 
a model that takes into account revenues, costs and in- 
vestments. 

The goal of the present study is to model the coupling 
between the evolution of the transportation network and 
the population density. More precisely, the question we 
aim to answer is the following. Given the pattern of 
growth of the entire population of a given city, how is the 
local density of population changing within the bound- 
aries of the city itself, and how the road network's topol- 
ogy is modified in order to accommodate these changes? 
There are in principle a huge number of potentially rele- 
vant factors that may influence the growth and shape of 
urban settlements, first and foremost the social, econom- 
ical and geographical conditions that causes the popula- 
tion of a given city to increase in a particular moment of 
its history. We neglect in the present study this class of 
factors and consider the overall growth in the number of 
inhabitants as an exogenous variable. In order to achieve 
conclusions that have a good degree of generality, and, at 
the same time, to maintain the number of assumptions as 
limited as possible, we focus on two main features only: 



the local density of population and the structure of the 
road network. Population density and the topology of 
the network constitute two different facets of the spa- 
tial organization of a city, and from a purely qualitative 
point of view it is not hard to believe that their evolu- 
tion is strongly correlated. Indeed, Levinson, in a recent 
case study Q about the city of London in the 19*'' and 
20*'' centuries has demonstrated how the changes in pop- 
ulation density and transportation networks deployment 
are strictly and positively correlated. Obviously, the road 
network tends to evolve to better serve the changing den- 
sity of population. In turn, the road network influences 
the accessibility and governs the attractiveness of differ- 
ent zones and thus, their growth. However, attractiveness 
leads to an increase in the demand for these zones, which 
in turn will lead to an increase of prices. High prices will 
eventually limit the growth of the most desirable areas. 
It is the mutual interaction between these processes that 
we aim to model in the present work. 

Although there are many other economical mecha- 
nisms (type of land use, income variations, etc.) which 
govern the individual choice of a location for a new 'ac- 
tivity' (home, business, etc), we limit ourselves to the two 
antagonist mechanisms of accessibility and housing price. 
These loosely defined notions can be taken into account 
when translated in term of transportation and rent costs. 
We note that in the context of the structure of land use 
surrounding cities, von Thiinen already identified the 
distance to the center (a simple measure of accessibility) 
and rent prices as being the two main relevant factors. 

At first we will discuss separately the two mechanisms 
of road formation and location choice. In particular, we 
explicitely consider the shape of the network and model 
its evolution as the result of a local cost-optimization 
principle Q . In classical models used in urban economics, 
transportation costs are usually described in a very sim- 
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plified fashion in order to avoid the description of a sep- 
arate transportation industry Also, when space is 
exphcitly taken into consideration, the shape of the trans- 
portation networks is rarely considered and transporta- 
tion costs are computed according to the distance to a 
city center (as it is the case in the classical von Thiinen's 
or Dixit-Stiglitz's ^8^ models). In these approaches 
transportation networks are absent, and displacements 
of goods and individuals are assumed to take place in 
continuous space. On one side this allows for a more 
detailed description of the economical processes at play 
during the shaping of a city. On the other side, these ap- 
proaches often rely on the hypothesis that the processes 
shaping a city are slow enough to allow the balancing 
of the different forces that contribute to these processes, 
allowing as a consequence the achievement of the global 
minimum of some opportune cost function. 

The point of view inspiring our work, instead, is 
that the evolution of a city is inherently an 'out-of- 
equilibrium' process where the city evolves in time to 
adapt to continuously changing circumstances. If some 
sort of optimization or 'planning' is driving the growth, it 
has to be continuously redefined in order to take into ac- 
count the ever-changing economic and social conditions 
that are ultimately responsible for the evolution of urban 
areas. We do not, therefore, assume the optimization of 
a global cost (or utility) function. 

Finally, we would like to mention that our goal is not 
to be as realistic as possible but to consistently repro- 
duce a set of coarse grained and very general features of 
real cities under a minimal set of plausible assumptions. 
Alternative explanations might also be possible and it 
would be interesting to compare our results with those 
produced in the same spirit. We hope that this simpli- 
fied model could serve as a first step in the direction of 
designing more elaborated models. 

This paper is organized in three main parts. In the 
first part, we briefly establish the framework to describe 
the model and discuss the empirical evidences that mo- 
tivated it. In the second part, we address the issue of 
how the growth of the local density affects the growth of 
the road network. In the third part we will study how 
the road network affects the potential for density growth 
in different areas. We finally integrate all these elements 
in the fourth section, where we study the full model and 
discuss our results. 



II. THE MODEL: EMPIRICAL EVIDENCES 
AND DEFINITION 

A. Framework 

In our simplified approach, we represent cities as a col- 
lection of points scattered on a two dimensional area (a 
square of linear size L throughout this study), and con- 
nected by a urban road network. The description of the 
street network adopted here consists of a graph whose 



links represent roads, and vertices represent roads' inter- 
sections and end points. Although the primary interest 
here is on roads' networks [lol. IT^. 

it is worth mentioning that transportations net- 
works appear in variety of different contexts including 
plant /leaves morphology [13i] , rivers 16] , mammalian cir- 
culatory systems [13 , commodity delivery [l^ , and tech- 
nological infrastructures [l9|. Indeed, networks are the 
most natural and possibly simplest representation of a 
transportation system [20, IMI ■ It would be impossible 
to review, even schematically, the approaches and the in- 
sights gained in the specific fields mentioned above, but 
it is at least worth to mention a few studies which at- 
tempted to connect the evolution of networks to an opti- 
mization principle (as it is the case of the present work) . 
Maybe is it not surprising that man-made transportation 
networks have been designed with the goal to serve effi- 
ciently and cost little [l8|, but relevant examples occur 
in natural sciences as well. It is remarkable, for exam- 
ple, how the Kirchoff law, that determines the current 
in the edges of a resistor network can be derived assum- 
ing the minimization of the dissipated energy . More 
recently, optimization principles have been successfully 
applied to the study of the transportation of nutrients 
through mammalian circulatory systems in order to ex- 
plain the allometric scaling laws in biology [l3,[13|- River 
networks constitute a further example where relevant fea- 
tures of the network or gan ization can be derived from 
an optimality principle |l6l . [2^ . A last example worth 
mentioning is that of metabolic networks, where it has 
been found that specific pathways appear if conditions 
for optimal growth are assumed (see e. g. [25|). It is 
interesting to notice that there have been attempts to 
put some of the examples discussed above in the con- 
text of a single framework (see [1^). In addition, let 
us mention that there is a huge mathematical literature 
that studies optimal networks and the flow they sup- 
port; Minimal Spanning Trees |27|. S teiner Trees [28j . 
and Minimum Cost Network Flows [29j | are just three ex- 
amples. Although the present study assumes a notion 
of optimality, there are some important aspects that dif- 
ferentiate it from the works discussed above. The first 
is that the principle of optimality is at work only lo- 
cally: there is no global cost function that our road net- 
works are supposed to minimize. The second, and pos- 
sibly more important, is that we attempt to establish 
a connection between the evolution of the network and 
that of the quantity that such network is supposed to 
transport, i.e. population. Transportation networks, as 
shown from the example cited above, can generally dis- 
play a large variety of patterns. However, recent empir- 
ical studies [11 [nl [la, [M III, [11, [M [si, [11, [11 111 
have shown that roads' networks, despite the peculiar 
geographical, historical, social-economical processes that 
have shaped distinct urban areas in different ways, ex- 
hibit unexpected quantitative similarities, suggesting the 
possibility to model these systems through quite general 
and simple mechanisms. In the following subsection we 
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FIG. 1: Left: Numbers of roads versus number of nodes (ie. 
intersections and centers) for data from I IQ] (circles) and from 
(squares). In the inset, we show a zoom for a small num- 
ber of nodes. Right: Total length versus the number of nodes. 
The line is a fit which predicts a growth as \/N (data from 



present the evidences that support the previous state- 
ment. 



B. Empirical results 

The degree distribution of planar networks decays very 
fast for large degrees as it usually happens for networks 
embedded in Euclidean space or with strong physical con- 
straints which prohibits the emergence of hubs [39] . The 
degree distribution is therefore strongly peaked around 
its average (over the whole city) (fc) — 2E/N = 2e {E 
is the number of edges-the roads-and N is the number 
of nodes-the intersections). Concerning the average de- 
gree of random planar networks, little is known: For one- 
and two-dimensional lattices e = 1 and e — 2, respec- 
tively, and a classical result shows that for any planar 
network (fc) < 6, implying e < 3 (see e.g. ^^). It has 
also been recently shown that planar networks obtained 
from random Erdos-Renyi graphs over a randomly plane- 
distributed set of points upon rejection of non-planar oc- 
currences, have e > 13/7 [41J. These facts are summa- 
rized in Fig. [T] (left), together with empirical data from 
20 cities in different continents [T3|. A first important 
empirical observation is that 1.05 < Cemp < 1-69 , in a 
range lying between trees and 2d lattices, and the aver- 
age degree over all these cities is (fc) « 2.87. The strongly 
peaked degree distribution suggests that a quasi-regular 
lattice could give a fair account of the road network topol- 
ogy. This suggestion is reinforced if one considers the 
cumulative length £ of the roads. With this picture in 
mind, one would expect that for a given average density 
p = N/L'^, the typical inter distance between nodes is 



The total length is then the number of edges 



times the typical inter-distance which leads to 



(1) 



This behavior is reproduced in fig. ([T|d), where a fit of the 
empirical data in [101 with a function of the form aN^^^ 
gives aemp ~ 1-46 ± 0.04 (a fit with a function of the 
form aN'' leads to a w 1.51 ± 0.24 and r w 0.49 ± 0.03). 



The value aemp = 1-46 has to be compared with the 
average degree over all cities. One finds 2.87/2 « 1.44 
and considering statistical errors, it is hard to reject the 
hypothesis of a slightly perturbed lattice as a model for 
the road network. 

The two empirical facts above lend credibility to the 
simple picture that city streets are described by a quasi- 
regular lattice with an essentially constant degree (equal 
to approximately 3) and constant road length {ii ~ 
1/^/p). There is however a further empirical fact which 
forces us to reconsider this simple picture. The roads' 
network define a tessellations of the surface and the au- 



thors of |35[ measured the distribution of the area A of 
the polygons delimited by the edges of the network. Sur- 
prisingly, they found a power law behavior of the form 



P{A) - 



(2) 



with a ~ 1.9 (the standard error is not available in [3a|). 
This fact contradicts the simple model of an almost reg- 
ular lattice since the latter would predict a distribution 
P{A) very pea ked around a value of the order of If. The 
authors of [33] also measured the distribution of the form 
factor given by the area of a cell divided by the area of the 
circumscribed circle (for this value they use the largest 
distance D between nodes of the cell, a convention that 
we adopted): (j> — 4A/ (irD^). They found that most cells 
have a form factor between 0.3 and 0.6 indicating a large 
variety of cell shapes. 

A first challenge is therefore to design a model for pla- 
nar networks that can reproduce quantitatively these fea- 
turees and which is based on a plausible (small) set of as- 
sumptions. The simple indicators discussed above show 
that one cannot model the network by either lattices, 
Voronoi tessellation, random planar Erdos-Renyi graphs, 
all these networks having a peaked distribution of areas 
and form factors. Let's note that the scale-invariant dis- 
tribution for cell sizes can be obviously reproduced by 
assuming by the fractal model of @ which assumes a 
self-similar process of road generation. The power law 
distribution for cell sizes automatically follows from this 
assumption. In the following, we present a model that 
relies on a simple plausible mechanism, does not assume 
self-similarity and quantitatively accounts for the empir- 
ical facts presented above. 



III. THE MODEL OF ROAD FORMATION 

We first discuss the part of our model that describes 
the evolution of the road network. Our main assump- 
tion is that the network grows by trying to connect to a 
set of points -the 'centers'- in an efficient and economic 
way. These centers can represent either homes, offices or 
businesses. This parameter free model is based on a prin- 
ciple of local optimality and has been proposed in Q . For 
the sake of self-consistency and readability, we first de- 
scribe this model in detail. The application of optimality 
principles to both natural and artificial transportation 
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FIG. 2: The nearest road to the centers A and B is M. The 
road will grow to point M'. The proposed minimum expen- 
diture principle suggests that the next point M' will be such 
that the variation of the total distance to the two points A 
and B is maximal. 



networks has a long tradition [i^, [13] ■ The rationale 
to invoke a local optimality principle in this context is 
that every new road is built to connect a new location to 
the existing road network in the most efficient way [3]. 
During the evolution of the street network, the rule is im- 
plemented locally in time and space. This means that at 
each time step the road network is grown by looking only 
at the current existing neighboring sites. This reflects 
the fact that evolution histories greatly exceed the time- 
horizon of planners. The self-organized pattern of streets 
emerges as a consequence of the interplay of the geomet- 
rical disorder and the local rules of optimality. In this 
regard our model is quite different from approaches to 
transportation networks where an equilibrium situation 
is assumed and which are based on either (i) minimiza- 
tion of an average quantity {e.g. the total travel time), 
or (ii) on the inclusion of many different socio-economical 
factors ( e.g. land use). 



A. Network growth 

When new centers (such as new homes or businesses) 
appear, they need to connect to the existing road net- 
work. If at a given stage of the evolution a single new 
center is present, it is reasonable to assume that it will 
connect to the nearest point of the existing road network. 
When two or more new centers are present (as in Fig. [2]) 
and they want to connect to the same point in the net- 
work, we assume that economic considerations impose 
that a single road - from the chosen network's point - is 
built to connect both of them. In the example of figure [21 
the nearest point of the network to both new centers A 
and B is M. Wc grow a single new portion of road of 
fixed length dx from M to a new point M' in order to 
grant the maximum reduction of the cumulative distance 
of A and B from the network. This translates in the 
requirement that 

Sd = d{M, A) + d{M, B) - [d{M', A) + d{M' , B)] (3) 



is maximal {dx being fixed). A simple calculation shows 
that the maximization of dd leads to 

> 

dMM' (XUA + UB (4) 

where ua {ub) is the unitary vector from AI to A {B). 

The procedure described above is iterated until the 
road from M reaches the the line connecting A and B, 
where a singularity occurs: dMM' = 0. From there two 
independent roads to A and B need to be built to con- 
nect to the two new centers. The rule Eq. ^ can be 
easily generalized to the case of n new centers, and, in- 
terestingly, was proposed in the context of visualization 
of leafs' venation patterns p^ . 

The growth scheme described so far leads to tree-like 
structures and we implement ideas proposed in in 
order to create networks with loops. Indeed, even if tree- 
like structures are economical, they are hardly efficient: 
the length of the path along a minimum spanning tree 
network for example, scales as a power 5/4 of the Eu- 
clidean distance between the end-points |14l |. Better ac- 
cessibility is then granted if loops are present. In order 
to obtain loops, we assume, following [13j, that a center 
can affect the growth of more than one single portion of 
road per time step and can stimulate the growth from 
any point in the network which is in its relative neigh- 
borhood, a notion which has been introduced in [l5| . In 
the present context a point P in the network is in the 
relative neighborhood of a center C if the intersection 
of the circles of radius d{P, C) and centered in P and C, 
respectively, contains no other centers or point of the net- 
work [l^ . This definition rigorously captures the loosely 
defined requirement that, for v to belong to the relative 
neighborhood of s, the region between s and v must be 
empty. At a given time step, a generic center C then 
stimulates the addition of new portions of road (pointing 
to P) from all points in the network that are in its relative 
neighborood, naturally creating loops. When more than 
one center stimulates the same point P the prescription 
of (|4]) is applied and the evolution ends when the list of 
stimulated points is exhausted (We refer the interested 
reader to [7] for a detailed exposition of the algorithm). 

The formula above can be straigthforwardly extended 
to the case of centers with non-uniform weight rj. This 
leads to a modified version of Eq. @ , where the sum of 
distances to be minimized is weighted by 77 and leads to 
> 

dMM' oc riAUA + Vbub- (5) 

where r]A and t/b can be different. Simulations with non- 
uniform centers weights show that - as far as the location 
of 'heavy' and 'light' centers is uniformly distributed in 
space, uncorrelated and not broad - that the structure of 
the network is locally modified, but that its large scale 
properties are virtually unchanged. In the algorithm pre- 
sented above, once a center is reached by all the roads 
it stimulates, it becomes inactive. An interesting vari- 
ant of this model assumes that centers can stay active 
indefinetively. In this case we expect a larger effect of 
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FIG. 3: Snapshots of the network at different times of its 
evolution: for (a) t = 1,000, (b) t = 2,000, (c) t = 3,000, (d) 
i = 4, 000. At short times, we have a tree structure and loops 
appear for larger density values obtained at larger times. For 
t = 4, 000, we have approximately 1, 700 nodes connected by 
2, 000 roads. 



We will leave this problem 



the weights' heterogeneity, 
for future studies. 

In the following, we study networks resulting from the 
growth process described above. We assume that the 
appearance of new centers is given exogenously and is 
independent from the existing road network and from 
the position and number of the centers already present. 
The model accounts quantitatively for a list of descrip- 
tors -the ones discussed above in an empirical context- 
that characterize at a coarse grained level the topology 
of street patterns. At a more qualitative level, the model 
leads to the presence of perpendicular intersections, and 
also reproduces the tendency to have bended roads even 
if geographical obstacles are absent. 

We show in Fig. [3] examples of patterns obtained at dif- 
ferent times. The model gives information about the time 
evolution of the road network: at earlier times, the den- 
sity is low and the typical inter-distance between centers 
is large (see Fig.lS]). As time passes, the density increases 
and the typical length to connect a center to the exist- 
ing road network becomes shorter. Since the number of 
points grows with time, the simple assumption that the 
typical road length is given by 1/y/p leads to £i ^ 1/Vt 
which is indeed what the model predicts. 

Beyond visual similarities, the model allows quantita- 
tive comparisons with the empirical findings The ratio 
e = E/N, initially close to 1 (indicating that the corre- 
sponding network is tree-like), increases rapidly with N, 
to reach a value of order 1.25 which is in the ballpark of 
empirical findings. The cumulative length of the roads 
produced by the model (Fig. S^) shows a behavior of the 
form a-\/iV with a ~ 1.90, in good agreement with the 
empirical measurements Qemp ~ 1.87). The form factor 
distribution (Fig. has an average value cj) = 0.6 and 
values essentially contained in the interval [0.3, 0.7] in 
agreement with the results in [s^ for 20 German cities. 
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FIG. 4: Results of the model (averaged over 1000 configura- 
tions), (a) Total length of roads versus the number of nodes. 
The dotted line is a square root fit. (b) Structure factor distri- 
bution showing a good agreement with the empirical results 
of [3^. (c-d) Rescaled distributions of the perimeter (c) and of 
the areas (d) of the cells displaying an exponential behavior. 



B. Effect of the center spatial distribution 

An important feature of street networks is the large 
diversity of cell shapes and the broad distribution of cell 
areas. So far, we have assumed that centers are dis- 
tributed uniformly across the plane. Within this assump- 
tion, the model predicts a cell area distribution following 
an exponential (with a large cut-off however) as shown in 
Fig. mjd) and Fig. O The empirical distribution of ccn- 







FIG. 5: Upper left plot: Uniform distribution of points (1000 
centers, 100 configurations). In this case, the area distribu- 
tion is exponentially distributed (bottom left). Upper right 
plot: Exponential distribution of centers (5000 centers, 100 
configurations, exponential cut-off = 0.1). In this case, we 
observe a power law (bottom right). The line is a power law 
fit which gives an exponent « 1.9. 

ters in real cities, however, is not accurately described by 
an uniform distribution but decreases exponentially from 



FIG. 6: In the presence of an obstacle (here a 'river' dehmited 
by the two dotted hne) in which the centers are not allowed to 
be located, the local optimization principle leads to a natural 
solution with a small number of bridges. 



the center [3l|, 132] ■ We thus use such an exponential 
distribution P{r) = exp(— |r|/rc) for the center spatial 
location and measure the areas formed by the resulting 
network (in the last section of this point is further dis- 
cussed). Although most quantities (such as the average 
degree and the total road length) are not very sensitive 
to the center distribution, the impact on the area distri- 
bution is drastic. In Fig. [5] a power law with exponent 
equal to 1.9±0.1 is found, in remarkable agreement with 
the empirical facts reported in [s^ for the city of Dres- 
den. Although we cannot claim that this exponent is the 
same for all cities, the appearance of a power law in good 
agreement with empirical observations confirms the fact 
that the simple local optimization principle is a possible 
candidate for the main process driving the evolution of 
city street patterns. This result also demonstrates that 
the centers' distribution is crucial in the evolution pro- 
cess of a city. 

The optimization process described above has several 
interesting consequences on the global pattern of the 
street network when geographical constraints are im- 
posed, as illustrated by the following example. We sim- 
ulated the presence of a river assuming that new centers 
cannot appear on a stripe of given width (and are oth- 
erwise uniformly distributed). The resulting pattern is 
shown in Fig. [6] The local optimization principle natu- 
rally creates a small number of bridges that are roughly 
equally spaced along the river and organizes the road net- 
work. To conclude, it is worth noting that in the present 
framework we didn't attempt the modelization of plan- 
ning efforts. Simulations show that, at the present sim- 
plified stage, the presence of a skeleton of "planned" large 
roads has the effect of partitioning the plane in different 
regions where the growth of the network is dominated by 
the mechanism described above, and reproducing on a 
smaller scale the structures shown in fig. [31 



C. Hierarchical structure of the traffic 

Finally, we discuss now the presence of hierarchy in the 
network generated by the model. Indeed, geographers 



FIG. 7; Traffic map of the network. The edge centrality is 
computed and divided in three different groups and the thick- 
ness of the edge is plotted according to the group (from thin 
to thick for increasing edge centrality. 



have recognized for a long time (see e.g. [2|) that many 
systems are organized in a hierarchical fashion. High- 
ways are connected to intermediate roads which in turn 
dispatch the traffic through smaller roads at smaller spa- 
tial scales. In order to test for the existence of such a 
hierarchy in our model, we use the edge betweenness 
centrality as a simple proxy for the traffic on the road 
network. For a generic graph, the betweenness centrality 
g{e) of an edge e [H, \^ [H^ is the fraction of shortest 
paths between any pair of nodes in the network that go 
through e. Allowing the possibility of multiple shortest 
paths between two points, one has 



Ke) = E 



(6) 



where ast is the number of shortest paths going from s to 
t and (Tgt[e) is the number of shortest paths going from 
s to t and passing through e. Central edges are therefore 
those that are most frequently visited if shortest paths 
are chosen to move from and to arbitrary points. We 
computed this quantity for all edges of the road network 
generated by our algorithm. It appears that this quan- 
tity g{e) is broadly distributed and varies over more than 
6 orders of magnitude. In order to get a simple represen- 
tation of this quantity we arbitrarily group the edges in 
three classes: [1,10''], [10'', 10^], [10^oo] and plot them 
with different thicknesses. In particular, we see in Fig [71 
that edges with the largest centrality (represented by the 
thickest line) form almost a tree of large arteries. Propor- 
tionality between traffic and edge-centrality, as defined 
above, is virtually equivalent to assuming: i) a uniform 
origin-destination matrix, ii) everybody choose the short- 
est path to reach a destination, and Hi) roads are "large" 
enough to support the traffic generated hy i) and ii) with- 
out congestion effects. Under these assumption one in- 
deed observes a hierarchy of smaller roads and streets 
with a decreasing typical length and the existence of a 
hierarchical structure of arteries, roads and streets. 
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IV. LOCATION OF CENTERS: EFFECT OF 
DENSITY AND ACCESSIBILITY 

In the simple version of the model presented above, 
the location of centers are independent from the topol- 
ogy of the road network. In real urban systems, this is 
however unlikely to happen. There is an extensive spa- 
tial economics literature (see Q and references therein) 
that focuses on the several factors that may potentially 
influence the choice location for new businesses, homes, 
factories, or offices (see also |45] and references therein). 
Empirical evidences suggest a strong correlation between 
transportation networks and density increase have been 
recently provided by Levinson Our goal here is to 
discuss, based on very simple and plausible assumptions, 
the coupled evolution of the road network and the pop- 
ulation density. 

We first divide the city in square sectors of area S, and 
we assume that the choice of a location for a new center 
is governed by a probability P{i) that one (i) of these 
sectors is chosen. This probability, which reflects the at- 
tractiveness of a location, depends a priori on a large 
number of factors such as accessibility, renting prices, in- 
come distribution, number and quality of schools, shops, 
etc. A key observation made by previous authors (sec for 
example p[6j and references therein) is that commuting 
cost differences must be balanced by differences in living 
spaces prices. We will follow this observation and we will 
thus focus on two factors which are the rent price and the 
accessibility (which we will reduce to commuting costs). 



A. Rent price and accessibility 

The housing price of a given location is probably deter- 
mined by many factors comprising tax policies, demog- 
raphy, etc., and i s by itself an important subject of study 
(see for example [47| ) . We will make here the simplifying 
assumption that the rent price is an increasing function 
of the local density (for each grid sector) p{i) — N{i)/S, 
where N{i) is the number of centers in the sector (i), 
and in particular that the rent price Cr is directly pro- 
portional to the local density of population (which can 
be seen as the first term of an expansion of the price as 
a function of the density) 



(7) 



where A is some positive prefactor corresponding to the 
price per density. We note here that a more general form 
of the type Cr — Ap{iY could be used. A preliminary 
study suggests that as long as the rent cost is an increas- 
ing function of the density, our results remain qualita- 
tively unchanged. It would be however very interesting 
to measure this function empirically. 

The second important factor for the choice of a location 
is its accessibility. Locations which are easily accessible 
and which allow to reach easily arbitrary destinations 



are more attractive, all other parameters being equal. 
Also, for a new commercial activity, high traffic areas 
can strongly enhance profit opportunities. In terms of 
the existing network, the best locations are therefore the 
most central and standard models of city formation (see 
for example [3|) indeed integrate the distance to the cen- 
ter and its associated (commuting) cost as a main factor. 
Euclidean distance, however, can be a poor estimator of 
the effective accessibility of a given location, if this loca- 
tion is poorly connected to the transportation network. 
This is why the notion of centrality has to be considered 
not only in geographical terms, but also from the point of 
view of the network that grants mobility. The possibility 
to easily reach an arbitrary location when movement is 
constrained by a network is nicely captured in quantita- 
tive terms by the notion of node betweenness centrality. 

In the previous section, we defined the edge between- 
ness centrality and here we need a similar quantity de- 
fined for nodes rather than for edges. The node between- 
ness centrality g{v) [H, [sO] of a node v is defined as 
the fraction of shortest paths between any pair of points 
in the network that go through v. The mathematical 
expression of this quantity is then 



1 



N{N -1) 



St 



(8) 



where ast is the number of shortest paths going from s to 
t and ast (v) is the number of shortest paths going from s 
to t and passing through v. Betweenness centrality was 
initially introduced as a natural substitute for geometric 
centrality in graphs that are not embedded in Euclidean 
space (see fig. [S]). 




oooooooooo 
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FIG. 8; Illustration of the notion of node betweenness cen- 
trality for a non planar graph 

Betweenness also naturally serves our purpose to quan- 
tify accessibility on planar graphs, especially in our sim- 
plified framework where an explicit distinction between 
resources and users has been sacrified to the sake of sim- 
plicity. It is important to note that betweenness central- 
ity, on planar graphs, is strictly correlated to other, more 
common, measures of centrality. The two first panels of 
fig[9]show the contour and the 3D plot of node-betweeness 
for a Manhattan-like grid of 25 blocks per side and clearly 
show how central nodes are those that are the most fre- 
quently visited if shortest paths are chosen to move from 
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and to arbitrary points. The third panel of the same fig- 
ure shows the relation between the betweenness of a node 
and its average distance from all other nodes (along the 
same square grid) . This plot demonstrates that the larger 
the betweenness of a node is, the shorter is its average 
distance from a generic node. The betweenness thus ex- 





Average Distance 

FIG. 9; Top panel: Contour plot of node betweenness cen- 
trality -not normalized-for a square grid of linear size 25 (A 
light color indicates a high value of the betweenness) . Middle 
panel: 3D plot of node betweenness centrality (not normal- 
ized) for the same square grid. Bottom panel: Plot of the 
node betweenness of a generic node vs its average distance 
from all the other nodes. 



tends the concept of geographical centrality to networks 
whose structure is not a lattice or planar. 

In our simple model, we will assume that accessibility 
reduces here to the facility of reaching quickly any other 
location in the network. This can also be seen as the av- 
erage commuting cost which in previous models was 
assumed to be proportional to the distance to the center. 
The natural extension for a network is then to take the 
transportation cost depending on the betweenness cen- 
trality. For each sector Si of the grid, we first compute 



the average betweenness centrality as 



(9) 



where the bar represents the average over all nodes (cen- 
ters and intersections) which belong in a given sector. 

Transportation costs are a decreasing function of the 
betweenness centrality and we will assume here that the 
transportation cost Crii) for a center in sector {i) is given 

by 



CT{^ = B{g„,-g{^)) 



(10) 



where B and gm are positive constants (other choices, as 
long as the cost decreases with centrality, linearly or not, 
give similar qualitative results). 

Finally, we will assume (as it is frequently done in 
many models, see for example (46j and references therein) 
that all new centers have the same income Y{c) — Y. 
This assumption is certainly a rough approximation, as 
demonstrated by effects such as urban segregation, but 
in order to not overburden our model, we will neglect in- 
come disparities in the present study. The net income of 
a new center c in a sector (i) is then 



(11) 



The higher the net income K(i) and the more likely the 
location (i) will be chosen for the implantation of a new 
business, home, etc. In urban economics the location is 
usually chosen by minimizing costs, and we relax this 
assumption by defining the probability that a new center 
will choose the sector (i) as its new location under the 
form 



P(z) 



This expression rewritten as 



J2 ■ ef^(^9U)-pU)) 



(12) 



(13) 



where PA is redefined as (3 and where A ~ B/A. For 
numerical simulations, the local density is normalized by 
the global density N/L'^, in order to have the density and 
centrality contributions defined in the same interval [0, 1]. 
The relative weight between centrality and density is then 
described by A and the parameter /3 implicitly describes 
in an 'effective' way all the factors (which could include 
anything from individual taste to the presence of schools, 
malls, etc) that have not been explicitly taken into ac- 
count, and that may potentially influence the choice of 
location. If /3 « 0, cost is irrelevant and new centers will 
appear uniformly distributed across the different sectors: 



P(^) 



1 



(14) 
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In the opposite case, (3 oo, the location with the min- 
imal cost will be chosen deterministically. 



P{i) = 1 for i such that K{i) is minimum 
P{i) — for all other sectors 



(15) 



The parameter /3 can thus be used in order to adjust 
the importance of the cost relative to that of other factors 
not explicitly included in the model. 



V. CO-EVOLUTION OF THE NETWORK AND 
THE DENSITY 

We finally have all the ingredients needed to simulate 
the simultaneous evolution of the population density and 
the road network. Before introducing the full model, 
analogously to what we have done for the first part of 
the model, it is worth to study this second part sepa- 
rately. To do that, we consider a toy -one dimensional- 
case, where the network plays no role, since a single path 
only exists between each pair of nodes. Despite the sim- 
plicity of the setting, it is possible to draw some general 
conclusion. 



A. One-dimensional model 

We assume that the centers are located on a one- 
dimensional segment [—L,L]. Since only a single path 
exists between any two points, the calculation of central- 
ity is trivial. In the continuous limit, and for a generic 
location x it can be written as the product of the num- 
ber of points that lie at the right and left of the given 
location 



piy,t)dy 



N - 



p{y,t)dy 



(16) 



where p{x, t) is the density at x. The equation for the 
density therefore reads: 



dtp{x, t) = e 



/3 A 



£. P(v.t)dy (N-f^j^ p(y,t)dy) p(^,t) 



(17) 



where TV = J'_^ p(y, t)dy. The numerical integration of 
Eq. pT|) shows that, after a transient regime, the process 
locks in a pattern of growth in which the total population 
grows at a constant rate 



N= I ^p{y,t)dy(xt. 



(18) 



This suggests that a solution for large t can be found via 
the separation of variables under the form 



pix,t) = af{x)t 



(19) 



where one can set f{x)dx = 1 without loss of gen- 
erality. Plugging the expression into Eq. (fT7|) one 
gets 



af{x) 



J[\J1^ f{y)dv{i-Jl^ f(y)dv)-f(x)] 



(20) 



where a is an integration constant to be determined. An 
explicit solution for the inverse f~^{x) can be achieved 
via the Lambert function (Lambert's function is the prin- 
cipal branch of the inverse oi z = wexp'"), but the ex- 
pression is not particularly illuminating and it is therefore 
not presented here. Several facts can however be under- 
stood using a direct numerical integration of Eq. p7p or 
the simulation of the relative stochastic process: 

• At large times, population in different location 
grows with a rate f{x) that depends on the loca- 
tion but not explicitely on time. This is a direct 
consequence of Eq. pop and is obviously a different 
behavior from uniform growth. The ratio of popu- 
lation density in two different locations xi and X2 
stabihzes in the long run to f{x{)/f[x2) 

• Although /3 models the 'noise' in the choice of lo- 
cation and A the relative importance of centrality 
as compared to density, they have similar effects 
on the expected density in a given location. An 
increase in (3 and A corresponds a concentration of 
density in the areas of large centrality and a steeper 
decay of density towards the periphery, as shown 
in fig. llOb ,. This can intuitively be understood by 
looking e.g. at the role played by the parameter /3 
in Eq. PH)) : as (3 decreases to 0, the differences in 
rate of growth in different locations becomes negli- 
gible 

• Eq. pT|) describes the average (or expected) behav- 
ior of the population density over time. Numerical 
simulations of the corresponding stochastic process 
show fluctuations from the above mentioned ex- 
pected value. Such fluctuations increases as noise 
increases (ie. when (3 decreases). 



• Numerical integration of Eq. p7|) suggests that, as 
A increases, the decay of density assumes a power 
law form whose exponent depends on (3 and A and 
approaches —1 as A gets very large. This can be 
explained by assuming f{x) ~ , and using 

Eq. (PO]) and its derivative both computed in L. 
The derivative of f{x) is: 



fix) = f3Xf{x) 1 



f{y)dy fix). (21) 



The above expression can be computed hi x = L, 
taking into account that that J'_^^ f{x)dx = 1 and 
the assumed algebraic functional form of f{x). This 
leads to 



7rL~i = 7/3L"''(A7 - r-fL-^). 



(22) 



In the limit of large L one can keep only the leading 
orders in L, and match the power and the coeffi- 
cient of the leading order on the two sides of the 




FIG. 10: The stationary growth rate for different values of the 
parameters, (a) Large values of /3 and A implies larger degree 
of centralization and a faster decay of density from center 
to periphery, (b) At large values of A the decay of density 
becomes algebraic for location away from the center. The 
exponent approaches —1 and f{x) is approximated in that 
region by l/(f3\/x). 
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FIG. 11: Networks obtained for different values of A (and for 
iV = 500 and 13=1). On the left, A = and only the density 
plays a role and we obtain a uniform distribution of centers. 
On the right, we show the network obtained for A = 8. In 
this case, the centrality is the most important factor leading 
to a fe^ ' 



equation above. This gives r = 1 and 7 = l/(/?A). 
The validity of this argument can be verified look- 
ing at fig. [TUb . where ln(/) is plotted vs ln(x) to 
highhght the power law behavior of f{x) and where 
the line l/{(3Xx) has been plotted as a reference for 
the case /3 = 10 and A = 10. 

This simple one-dimensional model thus allowed us to 
understand some basic features of the model that will be 
discussed in their full generality in the next section. 




B. Two-dimensional case: existence of a localized 
regime 

We now apply the probability in Eq. (fT3|) to the growth 
model described in the first part of this paper. The pro- 
cess starts with a 'seed' population settlement (few cen- 
ters distributed over a small area) and a small network of 
roads that connects them. At any stage, the density and 
the betweenness centrality of all different subareas are 
computed, and a few new centers are introduced. Their 
location in the existing subareas is determined accord- 
ing to the probability defined in Eq. (|13|) . Roads are 
then grown until the centers that just entered the scene 
are connected to the existing network. This process is 
iterated until the desired number of centers has been in- 
troduced and connected. In the two panels of figure [TT] 
we show the emergent pattern of roads that is obtained 
when A is small and very large, respectively. 

When A is small the density plays the dominant role 
in determining the location of new centers. New centers 
appear preferably where density is small, smoothing out 
the eventual fluctuations in density that may occur by 
chance and the resulting density is uniform. On the other 
hand, when A is very large, centrality plays the key role, 
leading to a city where all centers are located in the same 
small area. The centrality has thus an effect opposite to 
that of density and tends to favor concentration. We will 
now describe in more details the transition between the 
two regimes described above. 



FIG. 12: Fraction of dominating sectors (obtained for 500 
centers and averaged over 100 configurations). When A is 
small, the center distribution is more uniform and o is large 
(close to 100%). When A increases, we see the appearance of 
a few sectors dominating and concentrating most of the pop- 
ulation. This effect is smoothen out for smaller values of /3 
corresponding to the possibility of choice. 



We compute, in the two cases, the following quantity 
(previously introduced in a different context 51, 52]): 



N 



(23) 



where the sum runs over all sectors which number is Ng. 
In the uniform case, all the N{i) are approximately equal 
and one obtains I2 ^ which is usually small. In 

contrast, when most of the population concentrates in 
just a few sectors which represent a finite fraction of the 
total population, we obtain Y2 ^ 1/n where n represents 
the order of magnitude of these highly-populated sectors- 
the 'dominating sectors'. The quantity 



(24) 



YoNs 



gives therefore the fraction of dominating sectors. The 
behavior of a vs. A is shown in Fig. [T^l We observe that 
a decreases very fast when A increases, signaling that a 
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phenomenon of localization sets in as soon as transporta- 
tion costs are involved. 

We conclude this section discussing the role played by 
the parameter (3. Analogously to what happens in the 
one dimensional case, the concentration effect is weak- 
ened by a small values of (3. The parameter (3 describes 
the overall importance of the cost-factors with respect 
to other factors that have not been explicitly taken into 
account, or, equivalently, the possibility of choice. In- 
deed, when /3 is very large, the location which maximizes 
the cost is chosen. In contrast, when the parameter (3 
is small, the cost differences are smoothen out and a 
broader range of choices is available for new settlements. 
FigurefT2l illustrates the importance of choice. In particu- 
lar, the appearance of large-density zones (controlled by 
the importance of transportation accessibility) is coun- 
terbalanced by the possibility of choice and the resulting 
pattern is more uniform. 

C. Density profile: the appearance of core districts 




3 1 , \ , \ , \ , \ , 1 

0.2 0.4 0.6 0.8 1 

r 

FIG. 13: Density profiles for A = 8 (TV = 200, averaged over 
500 configurations). The decay of the density profile is well 
fitted by an exponential, signalling the appearance of a well- 
defined core district (error bars represent one standard devi- 
ation) . 



In this last part, we describe the effect of the interplay 
of transportation and rent costs on the decay of popula- 
tion density from the city center. In the following, the 
core district is identified as the sector with the largest 
density. The whole plane is then divided in concentric 
shells with internal radius r and width dr. The density 
profile p{r) is given by the ratio of the number 5n of 
centers in a shell to its surface (55 (r) 

P{r) = ^ (25) 

For small A, the density is uniform, as expected. In fig- 
ure [13] we show the density profile p(r) in the case of 
A large, where we observe a fast exponential decay of 
the form exp— r/rc, in agreement with empirical obser- 
vations [3l|. This behavior is the signature of the ap- 
pearance of a well-defined core district of typical size Tc, 
whose typical size Tc decreases with A. This simplified 
model predicts, therefore, the existence of a highly pop- 
ulated central area whose size can be estimated in terms 
of the relative importance of transport and rent costs. 

VI. DISCUSSION AND PERSPECTIVES 

We presented a basic model that describes the impact 
of economical mechanisms on the evolution of the popu- 
lation density and the topology of the road network. The 
interplay between rent costs and demand for accessibil- 
ity leads to a transition in the population spatial density. 
When transportation costs are moderate, the density is 
approximately uniform and the road network is a typical 
planar network that does not show any strong hetero- 
geneity. In contrast, if transportation costs are higher, 
we observe the appearance of a very densely populated 
area around which the density decays exponentially, in 



agreement with previous empirical findings. The model 
also predicts that the demand for accessibility easily pre- 
vails on the disincentive constituted by high rent costs. 

A very important ingredient in modeling the evolution 
of a city is how individuals choose the location for a new 
business or a new home. We isolated in this work the two 
important factors of rent price and transportations costs. 
For these costs, we assume some reasonable forms but it 
is clear that large scale empirical measures are needed. 
In particular, it would be interesting to characterize em- 
pirically how the rent price varies with the density and 
how transportation costs varies with the centrality. Pos- 
sible outcomes to these studies would be to give an idea 
of the value of the parameter A (and possibly also (3) and 
thus to determine how much the city is centralized. 

As it happens in every modeling effort, a satisfac- 
tory compromise between realism and feasibility must 
be found and we opted for sacrificing some important 
economic considerations in order to be able to explicitly 
take into account the topology of the road transporta- 
tion network and not the distance to a center only, as it 
is usually assumed in most models. Our model predicts 
so far the appearance of a core center, but it is however 
known that cities present a large diversity in their struc- 
ture, ranging from a monocentric organization to differ- 
ent levels of polycentrism. In addition, interesting scal- 
ing relations between different parameters (total wages, 
walking speed, total traveled length, etc) and population 
size were recently found [HI, [IJ showing that beyond the 
apparent diversity, there are some fundamental processes 
driving the evolution of a city. 

These different results appear as various facets of the 
process of city formation and evolution and it is at this 
stage not clear how to connect the scaling to the struc- 
tural organization of a city and more generally, how to 
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reconciliate the different existing results in a unified pic- 
ture. We believe that the present model- modified or 
generalized- could help for future studies in this direc- 
tion. 
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